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A  NEW  PROOF  OF  ADMISSIBILITY 


OF  TESTS  IN  THE  MULTIVARIATE  ANALYSIS  OF  VARIANCE 

T,  W.  Anderson  and  Akimichi  Takemura 


1.  Introduction. 

A  general  theorem  on  the  admissibility  of  tests  of  the  general  multi¬ 
variate  linear  hypothesis  was  proved  by  Schwartz  (1967)  and  Ghosh  (1964) 
using  Stein's  theorem  (19S6).  In  using  Stein's  theorem  there  are  two 
conditions  to  prove:  (i)  convexity  of  the  acceptance  region  and  (ii) 

existence  of  certain  alternative  hypotheses.  In  his  paper  on  the  admissi- 
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bility  of  Hotelling's  T  -test  Stein  (1956)  proved  the  convexity  of  the 
acceptance  region  by  showing  that  it  is  an  intersection  of  half-spaces. 
Schwartz  (1967)  and  Ghosh  (1964)  followed  this  approach. 

The  purpose  of  this  paper  is  to  present  a  new  proof  of  the  admissi¬ 
bility  of  the  tests  of  the  general  multivariate  linear  hypothesis.  We  show 
the  convexity  of  the  acceptance  region  more  directly;  that  is,  a  convex 
combination  of  two  sample  points  in  the  acceptance  region  again  belongs  to 
the  region.  The  separation  of  conditions  (i)  and  (ii)  simplifies  consider¬ 
ably  the  proof  of  the  convexity  condition  (i)  and  makes  its  geometrical 
meaning  clearer.  We  shall  be  explicit  also  in  proving  the  condition  (ii). 

In  Section  2  we  state  the  admissibility  results  in  several  forms  and 
discuss  their  relations.  In  Section  3  Stein's  theorem  on  the  admissibility 
of  tests  in  the  general  exponential  family  framework  is  stated.  The  rest 
of  the  paper  is  devoted  to  the  proof  of  the  theorems  in  Section  2. 


2.  The  Problem  and  Main  Results. 


In  this  section  we  set  up  the  problem  and  state  the  admissibility 
results  in  several  forms.  Discussion  of  the  relations  of  those  forms 
will  be  given.  For  proofs  see  Section  4. 

The  problem  of  testing  the  general  multivariate  linear  hypothesis  can 
be  written  in  the  following  canonical  form.  Let  X  (p  *  m) ,  Y  (pxr),  and 
Z  (p x n)  be  random  matrices  such  that  their  columns  are  independently 
normally  distributed  with  a  common  covariance  matrix  Z  and  means 

(2.1)  ex  =  m,  ey  =  n,  ez  =  o  , 

respectively.  (See  Anderson  (1958),  sec.  8.11.)  The  null  hypothesis  is 
H^:  M  =  0  and  the  alternative  hypothesis  is  M  ^  0.  The  parameter  space 
ft  is  given  by 

(2.2)  ft  =  { (M,N,Z) |  I  positive  definite}  . 

The  null  hypothesis  is 

(2.3)  ft^  =  { (M,N,Z) |  M  =  0,  Z  positive  definite}  . 

A  test  T*  of  the  null  hypothesis  H^:  to  e  ftp  is  said  to  be  admissible 
if  there  exists  no  other  test  T  such  that 

PrtReject  Hq|  T,to}  $  Pr{Reject  Hq|  T*,to},  to  e  ft^  , 

(2.4) 

Pr{Reject  HQ|  T,oj}  i  Pr{Reject  HQ|  T*,w},  to  e  ft  -  ftQ  , 
with  strict  inequality  for  at  least  one  to. 


The  usual  tests  of  the  above  null  hypothesis  can  be  given  in  terms  of 


the  (nonzero)  roots  of  the  following  determinantal  equation: 


(2.5) 

where 


| XX'  -  A(ZZ'  +  XX')  |  =  | XX*  -  A(U  -  YY')|  =  0, 
U  =  XX'  +  YY'  +  ZZ'  . 


Except  for  roots  that  are  identically  zero,  the  roots  of  (2.5)  coincide 
with  the  nonzero  characteristic  roots  of  X'(U  -  YY’)  ^X.  Let 


(2.f>)  V  =  (X,Y,U)  , 

and  let 

(2.7)  M(V)  =  X'(U  -  YY')-1X. 

The  vector  of  ordered  characteristic  roots  of  M(V)  is  denoted  by 

(2.8)  (Aj, . . . ,Am) *  =  A(M(V))  , 


where  Aj  ^  i  A^  J  0.  Since  the  inclusion  of  zero  roots  (when  m  >  p) 
causes  no  trouble  in  the  sequel  we  assume  that  the  tests  depend  on  A(M(V)). 
It  is  well  known  that  these  tests  are  invariant  under  certain  groups  of 
transformations.  See  Anderson  (1958),  sec.  8. 10, or  Lehmann  (1959),  sec.  7.9. 

The  admissibility  of  these  tests  can  be  stated  in  terms  of  the  geometric 
characteristics  of  the  acceptance  regions.  Let 


(2.9) 


R<  -  ""I  *1  *X2  2  5  V*°>- 

r;  .  {X  e  R*|  J.iO . Xn  »  0}  . 
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Definition  2.1.  A  region  A  in  R® 


is  monotone  if  A  e  A, 


v  e  R 


m 


v.  $ 
1 


V 


1 , . . . ,m,  imply  v  e  A. 


Definition  2.2.  For  A  C  R™  the  extended  region  A*  is  defined  by 

A*  =  ^(x7r(l)’-,,’x7r(m)^'  I  *  €  ^  • 

where  tt  ranges  over  all  permutations  of  (l,...,m). 

Now  we  can  state  the  following  theorem: 

Theorem  2.1.  If  the  region  A  in  R^  is  monotone  and  if  the  extended 
region  A*  is  closed  and  convex,  then  A  is  the  acceptance  region  of  an 
admissible  test. 

This  is  the  essential  part  of  the  most  important  result  in  the  main 
theorem  of  Schwartz  (1967). 

Another  characterization  of  admissible  tests  is  given  in  terms  of 
majorization. 
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Definition  2.3. 


A  vector  X  =  (X ^ .... ,Xm) *  weakly  majorizes  a 
vector  v  =  (v, ,...,v  )'  if 

~  l  m 

(2.10)  Al1  j  *V[!jt  [2]  iV[l]+V[2]'-‘-*X[l]  +  ---+X[m]  *  V[l]  +  *  *  *  +V[m]  ' 

where  X^j  and  v[j]»  i  =  are  the  coordinates  rearranged  in 

nonascending  order. 

We  use  the  notation 


X  >>.  v  or  v  <  X 
~  'w  ~  ~  'w  ~ 


if  X  weakly  majorizes  v. 


Remark.  If  X.v  e  R™  then  X  v  is  simply 


xi  *  V  Vx2  3  vi  * v2 . V  ■••4X.  * 


If  the  last  inequality  in  (2.10)  is  replaced  by  an  equality  we  say 
simply  that  X  majorizes  v  and  denote  this  by 


(2.11) 


X  >  V  or  v  <  X  . 


The  theory  of  majorization  and  the  related  inequalities  are  developed 
in  detail  in  Marshall  and  Olkin  (1979). 


Definition  2,4.  A  region  A  in  R™  is  said  to  be  monotone  in 
majorization  if  X  f  A,  v  e  R®  ,  v  ^  X  imply  v  e  A. 
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I 


Figure  2.2 

Theorem  2.2.  If  a  region  A  in  R™  is  closed,  convex,  and  monotone 
in  majorization,  then  A  is  the  acceptance  region  of  an  admissible  test. 

Theorem  2.1  and  Theorem  2.2  are  equivalent.  It  will  be  seen  in 
Section  4  that  Theorem  2.2  can  be  more  conveniently  proved.  Then  an 
argument  about  the  extreme  points  of  a  certain  convex  set  (Lemma  4.12) 
establishes  the  equivalence  of  the  two  theorems. 

Application  of  the  theory  of  Schur-convex  functions  yield  several 
corollaries  to  Theorem  2.2. 

Corollary  2.1.  Let  g  be  continuous,  nondecreasing,  and  convex 
in  [0,1).  Let 

(2-12)  f(A)  =  f(XA . Am)  =  f  g(Ai)  . 

Then  a  test  with  the  acceptance  region 

A  =  (A |  f (A)  s  c} 

is  admissible. 
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A  proof  of  this  is  given  in  Section  4.  Setting  g(A)  =  -  log(l-A), 
g(A)  =  A/(l-A),  g(A)  =  A,  respectively,  shows  that  Wilks’  likelihood 
ratio  test,  the  Lawley-Hotelling  trace  test,  and  the  Bartlett-Nanda-Pillai 
test  are  admissible.  Admissibility  of  Roy's  maximum  root  test 

A:  A1  $  c 

can  be  proved  directly  from  Theorem  2.1  or  Theorem  2.2. 

We  note  that  (2.12)  is  a  special  case  of  a  Schur-convex  function.  A 
function  <J>  is  called  Schur-convex  if  x  -<  y  implies  4>(x)  S  <j>( y) .  The 
following  facts  are  well  known  in  the  theory  of  Schur-convex  functions. 

(See  Marshall  and  Olkin  (1979),  Chap.  3.)  If  <p  is  increasing  in  each 
argument  and  Schur-convex  then  x  «<w  y  implies  cf>(x)  £  'My)*  If  4>  is 
symmetric  and  quasi-convex  (that  is,  (x|  4> (x)  £  c)  is  convex  for  all  c) , 
then  ip  is  Schur-convex.  Combining  these  facts  we  obtain  the  following 
corollary. 


Corollary  2.2.  If  f  is  lower  semicontinuous,  increasing  in  each  argu¬ 
ment,  symmetric,  and  quasi-convex,  then  a  test  with  the  acceptance  region 
A  =  (A |  f (A)  $  c)  is  admissible. 


Corollary  2.2  is  identical  to  the  essential  part  of  Theorem  1  of 
Schwartz  (1967).  It  is  interesting  to  see  that  he  arrived  at  these 
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conditions  on  f  without  the  notion  of  Schur-convexity .  From  our  view¬ 
point  the  meaning  of  the  conditions  are  clearer  here.  Note  that  Corollary 
2.2  is  equivalent  to  Theorem  2.1. 

The  rest  of  the  paper  will  be  devoted  to  the  proof  of  the  above 
theorems . 


3.  Stein's  Theorem  and  the  Exponential  Family. 

In  this  section  we  state  Stein's  theorem  and  show  how  our  problem 
fits  its  setting. 

An  exponential  family  of  distributions  consists  of  a  finite-dimensional 
Euclidean  space  a  measure  m  on  the  o-algebra  of  all  ordinary 

Borel  sets  of  }/-,  a  subset  SI  of  the  adjoint  space  (the  linear 

space  of  all  real -valued  linear  functions  on  j^)  such  that 


(3.1) 


<Kw)  = 


JS'l 


¥ 


dm(y)  <  «o 


w  f  SI  , 


and  P,  the  function  on  fi  to  the  set  of  probability  measures  on  2$ 
given  by 


P  (A) 
0) 


I 

<Ku>) 


w'y  ,  ,  . 
e~  ~  dm(y). 


A  e  ^  . 


Theorem  3.1  (Stein).  Let  (^m.Sl.P)  be  an  exponential  family  and 
S1Q  a  nonempty  proper  subset  of  fi.  Let  A  be  a  subset  of  y  such  that 
(i)  A  is  closed  and  convex  and  (ii)  for  every  vector  w  e  and  real 
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c  for  which  (y |  oo'y  >  c}  and  A  are  disjoint,  there  exists  w  e  ft 
such  that  for  arbitrarily  large  \  +  Xoj  e  Q  -  fig.  Then  the  test 

with  acceptance  region  A  is  admissible  for  testing  the  hypothesis 
id  c  Qjj  against  the  alternative  id  e!l  -  !)jj, 

Note  that  A  need  not  be  closed  if  the  boundary  of  A  has  m-measure 
zero.  (See  Proposition  1  in  Appendix.) 

We  rewrite  the  distribution  of  (X,Y,Z)  in  an  exponential  form.  Let 
U  =  XX'  +  YY'  +  ZZ'  =  (u^)  and  E_1  =  (o1J).  For  a  general  matrix 
C  =  (c  ...,ck)  let  Vec(C)  =  (c* ....  *c^.) ' .  The  density  of  (X,Y,Z)  can 
be  written  as 

(3.2)  f(X,Y,Z)  =  K(M,N,£)  exp{tr  M' E_1X  +  tr  N' Z_1Y  -  ~  tr  £_1U} 

=  K(M,N,Z)  expfwJjjXjjj  ♦«'(2)y(2)  +  ^(3)^(3) }  * 

where 

exp{-  j  tr  Z_1(MM'  +  NN')} 

K(M,N,Z)  =  - j - - -  l  - -  > 

p(m+r+n)  4(m+r+n) 

(2w)2  |l| 

(3.3)  Ol.j,  =  vec(r'2H) ,  =  vec(Z  * 

ij(J)  -  -  j  (.“V2 . 2o1P,o22 . o»)'. 

y(1)  =  vec(x) ,  y(2)  =  vec(Y)  , 

3)  =  (ull'u12,-"’Ulp,U22’"',Upp)  ‘ 
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If  we  denote  the  mapping  (X,Y,Z)— ■ >y  =  (y^  >Y  [2)  *>^3))  hY  &  * 
(?-4j  y  =  g(X.Y.Z)  , 


then  the  measure  of  a  set  A  in  the  space  of  y  is 

(3.5)  m(A)  =  p (g~ 1 (A) )  , 

where  p  is  the  ordinary  Lebesgue  measure  on  gP(m+r+n)_  note  that 
(X,Y,U)  is  a  sufficient  statistic  and  so  is  y  =  (y|j ^ ^(2) ’^(3) ^ ' ’ 

Because  a  test  which  is  admissible  with  respect  to  the  class  of  tests 
based  on  a  sufficient  statistic  is  admissible  in  the  whole  class  of  tests, 
we  consider  only  tests  based  on  a  sufficient  statistic.  Then  the  acceptance 
regions  of  these  tests  are  subsets  in  the  space  of  y.  The  density  of 
y  given  by  the  right  hand  side  of  (3.2)  is  of  the  form  of  the  exponential 
family  and  therefore  we  can  apply  Stein's  theorem.  Furthermore,  since  the 
transformation  (X,Y,U)— >y  is  linear,  we  prove  the  convexity  of  an  acceptance 
region  in  terms  of  (X,Y,U).  The  acceptance  region  of  an  invariant  test  is 
given  in  terms  of  A(M(V))  =  (A^ , . . . , A^) ' .  Therefore,  in  order  to  prove  the 
admissibility  of  these  tests  we  have  to  check  that  the  inverse  image  of  A 

{V|  A (M(V) )  e  A} 

satisfies  the  conditions  of  Stein's  theorem.  We  carry  this  out  in  the  next 
section. 

For  the  sequel  we  use  the  following  notation: 

A  >,  B  if  and  only  if  A-B  is  positive  semidefinite, 

A  >  B  if  and  only  if  A-B  is  positive  definite  . 
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4.  Proof s. 


We  start  with  lemmas  concerning  matrix  inequalities. 

Lemma  4.1. 

(4.1)  PU1  +  qU2  -  (pYj  +  qY2)(pYj  +  q^)' 

>.  pcyx  -  y^y|)  ♦  qcy2  -  i2r2) . 

Proof.  The  left-hand  side  minus  the  right-hand  side  is 

(4 . 2)  pYxY;  +  q^Y’  -  p2^;  -  q2Y2Y2  -  pqfY^’  +  Y^J) 

■  Pd-PJXiXj  Md-qJY^  -  PqtY^  +  Y2Yj) 

=  pq(Y1-Y2) CYx-Y2) *  *  0  . 


Lemma  4.2.  If  A  >  B  >  0,  then  A-1  $  B-1  . 


Q.E.D. 


Proof.  Let  A  =  FDF'  and  B  =  FF'  where  D  is  diagonal  and  F 
is  nonsingular.  Then  A  ^  B  implies  D  *  I,  and  B_1  -  A-1  =  (F'  )-1  (I-D-1)F_1  >,  0 
because  I  -  D  £  0.  Q.E.D. 


Lemma  4.3.  If  A  >  0  ,  then  f(x,A)  =  x'A  ^"x  is  convex  in  (x,A)  . 


Proof.  If  A  =  D  is  diagonal,  then 


f(x,D)  =  x'D_1x  =  l  x?/d  . 
-  ~  ~  i=l 
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2 

is  a  convex  function  because  it  is  the  sum  of  convex  functions  x./d... 

1  11 

2 

(The  matrix  of  second-order  partial  derivatives  of  x  /d  is  positive 
semidefinite. )  In  general,  we  can  write 

A  =  F  D  F'  ,  A2  =  FF'  . 

Hence 

(4.3)  (p^  +  q^'CpAj  +  q^2)1(pil  +  q^ 

-  *-pF  ^i  +  qp  ^2^  *-p?  +  qI)  1  (pp  1}Ci  +  qp  ^x2^ 

*  p(F“1x1),D_1  ♦  q(F_1x2)  ’  (F_1x2) 

*  - 1  t  j 

=  p?i^i  *l  +  q$2~2  ~2  ‘  Q.E. 


Lemma  4.4.  If  A,>0,  A,,  >  0,  then 

—•I  -  %*.  ~ 


(4.4) 


(p5i  +  q?2^'(p~i  +  qA2)_1 Cpfi  +  q?2) 

*  *  q?2~21?2  * 


Proof.  From  Lemma  4.4,  we  have  for  all  y 
py'BjA^Bjy  +  qy’BjA^B^  -  y'^Bj.  +  q?2),(p6l  +  q~2)_1(p?l  +  q®2}Z 

=  pC5ir3  *6i1(5i?)  +  q(?2>:),^21(?2>:) 

"  *-p?l^  +  q?2^  *  ^p~l  +  q^2^  1<P®iy  +  qB2^ 

*  0  . 
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Thus  the  matrix  of  the  quadratic  form  in  y  is  positive  semidefinite.  O.ll.U. 

The  relation  as  in  (4.4)  is  sometimes  called  "matrix  convexity".  (See 
Marshall  and  Olkin  (1979).) 

Theorem  4.1. 

MCpV'i  ♦  qV,)  *  pMfVj)  +  qM(V2)  , 

where 

Vj  =  CXj.Yi.U)),  V2  =  (X2.Y2.U2,  , 

Uj  -  YaY;  >0,  U,  -  Y,Y’  >  0  , 

0  £:  p  =  1  -  q  i  1. 

Proof.  Lemma  4 . 1  and  Lemma  4 . 2  show  that 

,  -1 

(4.5)  [PjU  ♦  qy2  -  (pYj  +  ♦  qY2)  ] 

$  [p(Uj  -  YjYj)  +  m(U2  -  Y^)]'1  • 

This  implies 

(4.6)  M(pV1  ♦  qV2)  *  (pXj  ♦  qX^’lpfUj  -  Y^J)  ♦  q(U2  -  Y^)]"1^  ♦  qX2)  . 

Then  Lemma  4.4  implies  that  the  right-hand  side  of  (4.6)  is  less  than  or 
equal  to 
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Q.E.D. 


-1, 


PX1  (Uj  -  YjYj)  1X1  *  qX^(U2  -  Y^)"1^  =  P*^)  +  qM(V,)  . 


For  the  next  step  we  need  several  lemmas  concerning  major ization. 
Proofs  are  given  here  so  that  the  paper  is  self-contained.  For  a  full 
discussion  see  Marshall  and  Olkin  (1979). 


Lemma  4,5. 

k 

(4.7)  l  A.  (A)  =  max  tr  R'AR  . 

i=l  1  ~  R’R  =  I. 

~  -  ~k 


Proof. 

(4.8)  max  tr  R'AR  =  max  tr  Q'DQ  , 

?'R=ik  ~  ~~  Q'Q  =  Ik 

where  D  is  a  diagonal  matrix  with 

(d  ,d  )’  =  A (A)  . 

l I  mm  ~  ~ 

If  Q  is  the  appropriate  submatrix  of  the  identity  matrix,  equality  in 
(4.7)  is  achieved.  To  prove  the  inequality  we  augment  Q  to  a  full  ortho¬ 
gonal  matrix  G  and  let 

b  =  G'DG  =  (h  )  . 

Then 

trQ'DQ=  hn  +  ...+hkk. 
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Note  that 


b.  . 
JJ 


l 


i  =  1 


X .  g 

i6 


2 

i.i 


or 


where 


(b 


11’ 


•  ,b  ) 
mm 


PX  , 


P 


(PiJ) 


P  is  a  doublv  stochastic  matrix.  Then  (4.7)  follows  from  X  )>•  PX 
(Lemma  4.6) .  Q.E.D. 


Lemma  4.6.  If  P  is  an  m  x  m  doubly  stochastic  matrix,  then 


Proof. 


(4.9) 


k  m 


l  l  Pi  =  I  Vj  • 

i=l  j  =  i  -1  j  =  l  -1  J 


where 


g .  =  y  p. . 
J  i=i 


(0  *  g.  $  1,  l  g,  =  k)  • 
J  j»l  ^ 


Then 


ih  g7j  -  J,  ’’i  '  J,  V*  -  l  h  '  -  Jt  «!> 
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i  ovv'v1)  *i  <w*i 

i  =  J  i=k+l 


s  0  . 


Q.H. 


Remark.  Lemma  -4.0  holds  with  replacing  K 
-  w 


Lemma  4.7.  It'  At  B,  then  A  (A)  A(B)  . 

F  roo  f .  F  rom  Lemma  4  .  .A 

k  k 

(4.10)  ^  A  (A)  =  max  tr  R'AR  t  max  tr  R'BR  =  £  A.(B)  . 

i=l  1  R'R  -  1  '  R’R  =  |k  "  i =1  1 

Remark.  Actually  A  t  B  implies  A^(A)  <_  A^  (B)  ,  i  ■ 
but  this  stronger  result  is  not  needed. 


Lemma  4.8. 


A ( A ♦ B )  A (A  i  +  A (  B  )  . 


Proof . 

k 

(4.11)  J  A.(A+B)  =  max  tr  R’(A+B)R 

i  =  l  1  R’R  =  I, 

k 

f:  max  tr  R'AR  ♦  max  tr  R'BR 

R'R  =  1,  R'R  =  1. 

k  k 

k  k 

=  I  A  (A)  +  l  A  (B) 
i= 1  i=l 

k 

=  l  (A. (A)  ♦  i ..(B)}  .  Q.E.D 
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Remark.  Lemma  4.8  holds  with  replacing  J,  . 

Now  the  matrix  inequality  of  Theorem  4.1  translates  into  majorization 
of  vectors  of  characteristic  roots. 

Theorem  4.2. 

MNKpVj  +  qV2))  <w  pA(M(Vj))  ♦  qA (M(V2> )  . 

Proof.  Theorem  4.1  and  Lemma  4.7  imply 

(4.12)  MMCpVj  ♦  qV2) )  ACpMfVj)  ♦  qM(V2))  . 

Then  by  Lemma  4 . 8 

(4.13)  MpMtVj)  ♦  qM(V2))  pA(M(Vj))  ♦  qA(M(V2))  . 

From  (4.12)  and  (4.13)  Theorem  4.2  follows.  Q.F..D. 

Now  let  A  be  a  region  in  the  space  of  roots  and  let  A  be  the  inverse 
image  of  A  in  V  (»(X,Y,U))  space, 

A  -  (V[  A(M(V))  fc  A}  . 

We  want  to  show  that  A  is  convex  for  a  region  satisfying  the  condition  of 
Theorem  2.2.  Let  Vt  -  (X^Y^l^)  fc  A  ,  i  -  1,2  .  Then  AOKVj))  fc  A  , 
i  “  1,2,  and  by  convexity  of  A  we  have  pA(M(V^))  +  qA(M(V2))  fc  A.  Then 
by  Theorem  4.2  and  monotonicity  in  majorization  of  A 

AMpVj  ♦  qV2))  e  A  . 
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Hence  pV^  +  qV9  e  A  and  A  is  convex.  Furthermore  the  boundary  of  A 
has  probability  0.  (See  Propositions  1  and  2  in  Appendix.)  Therefore, 
condition  (i)  of  Stein's  theorem  is  satisfied. 


^(1)  =  X(M(Vj)) 
A(2)  -A(M(V2)) 


For  the  condition  (ii)  we  have  the  following  lemma.  The  proof  was 
suggested  by  Charles  Stein. 


Lemma  4.9.  For  the  acceptance  region  A  of  Theorem  2.1  or  Theorem 
2.2  the  condition  (ii)  of  Stein's  theorem  is  satisfied. 

Proof.  Let  to  correspond  to  ($>,')', 0)  then 


(4.14) 


=  ~U)?(D  +  ~ (2)~ (2)  +  ' (3)~ (3) 


=  tr  $'X  +  tr  f'Y  -  j  tr  OU  , 
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where  0  is  symmetric.  Suppose  that  (y|co'y  >  c}  is  disjoint  from  A  . 
We  want  to  show  that  in  this  case  0  is  positive  semidef inite.  If  this 
were  not  true,  then 


-i  o  o 

0  =  F  0  I  OF’, 

0  0  0 


where  F  is  nonsingular  and  -I  is  not  vacuous.  Let 


X  =  (1/Y)X0,  Y  =  (1/Y)Y0. 


yl  0  0 

U  =  (F')_1  0  1  0  F'1  , 

0  0  I 


V  =  (X.Y.U)  , 


where  XQ,  Y^  are  fixed  matrices  and  y  is  a  positive  number.  Then 


yl  0  0 

(4.15)  u)’y  =  ^  tr  <f’X0  +  ~  tr  4"Y0  ♦  j  tr  0  -1  0  >  c 

0  0  0 


for  sufficiently  large  y.  On  the  other  hand, 


(4.16)  X (M(V) )  = 


A{X' (U-YY' )_1X} 


J  yl  0  0 

“tH?o  (~')1  9  I  9  f1 

Y  |  0  0  1 


0  as  y  — >  oo. 


-7  lolo  h 


Therefore,  V  e  A  for  sufficiently  large  y.  This  is  a  contradiction. 
Hence  0  is  positive  semidefinite. 

Now  let  (jOj  correspond  to  ($^,0,1),  where  f  0.  Then  I  +  A0  is 

positive  definite  and  <t>  +  A<J>  ^  0  for  sufficiently  large  X.  Hence, 

(Oj  +  Aoj  e  £1  -  Qq 

for  sufficiently  large  X.  Q.E.D. 

Now  we  have  proved  Theorem  2.2. 

Proof  of  Corollary  2.1.  Being  a  sum  of  convex  functions  f  is  convex 
and  hence  A  is  convex.  A  is  closed  because  f  is  continuous.  We  want 
to  show  that  if  f(x)  g  c  and  y  x  (x,y  e  R™)  then  f(y)  $  c.  Let 

xk  =  Ei=l  xi>  yk  =  ^i=l  yi'  Then  y  \  *  if  and  only  if  xk  *  yk’ 
k  =  1 , .  • .  ,m.  Let  f(x)  =  h^,...,^)  =  g(xL)  +  gt^-x^).  It 

suffices  to  show  that  h(x^,...,xm)  is  increasing  in  each  xL  .  For 

i  £  m-1  the  convexity  of  g  implies  that 

h(x1,...,x.+e>...,xm)  -  h(x1,...,x.,...,xm) 

=  g(xi+e)  -  g(xi)  -  {g(xi+1)  -  g(xi+1-e)} 

*  0  . 

For  i  =  m  the  monotonicity  of  g  implies 

hfx^  . . .  ,xm+e)  -  Mxj, . . .  ,xm)  =  g(xm  +  e)  -  g(xm)  *  0  .  Q.E.D. 
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Lemma  4.10.  A  C  is  convex  and  monotone  in  majorization  if 
and  only  if  A  is  monotone  and  A*  is  convex. 


Proof. 

Necessity.  If  A  is  monotone  in  majorization, then  it  is  obviously 
monotone.  By  Proposition  3  of  Appendix  A*  is  convex. 

Sufficiency.  For  A  e  R™  let 

(4.17)  C (A)  =  (x|  xe  rJ,  x  ^  A}  , 

D(A)  =  (x|  x  e  R”,  x  A}. 


It  will  be  proved  in  Lemma  4.12  and  its  corollary  that  monotonicity 

of  A  and  convexity  of  A*  implies  C(A)  c  A*.  Then  D(A)  =  C(A)  n  R™  c 

A*  n  Rm  =  A.  Now  suppose  v  e  Rm  and  v  <  A.  Then  v  e  D(a)  c  A.  This 
^  ",  ^  w  ^ 

shows  that  A  is  monotone  in  majorization.  Furthermore,  if  A*  is  con¬ 
vex,  then  A  =  R™  n  A*  is  convex.  Q.E.D. 
x2 


Lemma  4.11.  Let  C  and  D  be  closed  and  convex, 
of  C  are  contained  in  D,  then  CCD. 


extreme  points 


If  the  extreme  points 
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Proof.  Obvious. 


Now  we  present  the  following  key  lemma. 


Lemma  4.12.  The  extreme  points  of  C(A)  are  all  vectors  of  the  form 


(4.18) 


(6Tr(l)XiT(l)'---’lSTT(m)XTT(m))  * 


where 


7r  is  a  permutation  of  (1 , . . .  ,m)  and  6  =  . . .  =  6^  =  1 , 


6,  .  =  ...  =  6  =0  for  some  k. 

k+1  m 


Proof.  C(A)  is  convex.  (See  Proposition  4  of  Appendix).  Now 
note  that  C(A)  is  permutation  symmetric,  that  is,  if  (x^,...,x  )'  e  C(A), 
then  (xTr(i)  >  •  •  •  ,XTr(m)^ '  €  f°r  an^  Permutati°n  ir.  Therefore,  for 

any  permutation  it,  tt(C(A))  =  { (x^j , . . .  .x^^  ) '  |  x  e  C(A)}  coincides 
with  C(A).  This  implies  that  if  (Xj,...,x  )'  is  an  extreme  point  of 
C (A) ,  then  (x^^  , . . .  ,x^m^)  '  is  also  an  extreme  point.  In  particular, 
(x^j ,. . .  ,Xjmj)  €  R™  is  an  extreme  point.  Conversely  if  (x^,...,xm)  e  R™ 
is  an  extreme  point  of  C(A),  then  (^(i) > •  •  •  »x7T(m)^ '  an  extreme  point. 

We  see  that  once  we  enumerate  the  extreme  points  of  C(A)  in  R™,  the 
rest  of  the  extreme  points  can  be  obtained  by  permutation. 

Suppose  x  e  R™.  An  extreme  point  being  the  intersection  of  m 
hyperplanes  has  to  satisfy  m  of  the  following  2m  equations: 


(4.19) 

V 

X1  = 

0  , 

V 

X1  =  X1  ’ 

V 

x2  = 

0  , 

F2: 

=  Ai+A2  , 

E  : 
m 

xm 

0  , 

Fm: 

x.  +  . . .  +  x  =  A.  +  . , 
1  ml 

.  •  ♦  Am  . 
m 

Suppose  that  k  is  the  first  index  such  that  holds  (namely,  suppose 

that  hold).  Then  x  £  implies 


0  =  x.  Z  x.  ,  Z  ...  %  x  =  0 
k  k+1  m 


Therefore,  E]c+i>--*»Em  hold.  This  gives  the  point  A  =  , . . . ,  X.  1,0,...,0) 

which  is  in  R™  n  C(A).  Therefore  X  is  an  extreme  point.  Q.E.D, 


Remark.  If  A.  >  A_  >  . . .  >  A  above,  the  total  number  of  distinct 
-  1  i  m 

extreme  points  is 


I  V  m~i -s  i  =  m!  I  r-r-  4  m!  e  . 

k=0  (m-k)!  k=0  k 


This  follows  from  the  fact  that  by  permutation  of  X  =  (X^ , . . . ,X^,0, . . . ,0) ' 
we  obtain  m!/(m-k)!  distinct  points. 

Mirsky  (1959)  gave  the  explicit  expression  of  the  extreme  points  in 
the  form 


(4.20) 


(6.X  6  X  f  -,)  , 

v  1  tt(1)  ’  ’  m  ir(m)  ’ 


where  tt  is  a  permutation  and  6^'s  are  zero  or  one.  Actually  this  set 
of  points  include  some  points  that  are  not  extreme  points. 


Corollary  4.1. 


C(A)  C  A* 


Proof.  If  A  is  monotone,  then  A*  is  monotone  in  the  sense  that 

i=(A! . V’eA*.  V  =  CV1  Vi«Xi>  i  -  !  , . . .  ,m. 


then  v  e  A*.  (See  Proposition  S  of  Appendix.)  Now  the  extreme  points 
of  C(A)  given  by  (4.18)  are  in  A*  because  of  permutation  symmetry 
and  monotonicity  of  A*.  Hence  by  Lemma  4.11  C(A)  c  A*.  Q.E.D. 

Proof  of  Theorem  4.1.  Immediate  from  Theorem  4.2  and  Lemma  4.10.  Q.E.D. 
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We  prove  here  some  technical  propositions  to  complete  the  argument 
in  Section  4. 

Proposition  1.  The  condition  (i)  in  Stein's  theorem  can  be  replaced 
by  (i ' )  A  is  convex  and  the  boundary  of  A  has  m-measure  zero. 

Proof.  If  A  is  convex,  then  A  (=  closure  of  A)  is  convex. 
Furthermore, 

A  n  {y|  u)'y  >  c}  =  =»  A  c  {yj  oi’y  $  c} 

=*  A  c  {y  |  Od'y  £  c}  =»  A  n  {y|  w'y  >  c}  =  <}l 

Therefore  Stein's  theorem  holds  with  A  replaced  by  A.  Finally,  m(A-A) 
implies  that  tests  with  the  acceptance  regions  A  and  A  are  equivalent 
(for  any  u>  e  ft) . 

Proposition  2.  The  boundary  of  A  in  th'  proof  of  Theorem  4.11 
has  m-measure  zero. 

Proof.  We  claim  that 

closure  of  A  C  A  U  (V|  U-YY'  is  singular}  =  A  U  c  , 

where  C  =  (V |  U-YY'  is  singular}.  Obviously  m(C)  =  0.  This  implies 
m(C)  i  m(boundary  of  A )  =  0.  Now  suppose  V  =  (X,Y,U)  e  closure  of  A. 


Then  V  =  lim  V.  =  1 im(X. , Yi ,U.),  where  V.  e  A  or  A(M(V.))  e  A.  If 
U  -  YY'  is  singular,  V  e  C.  If  U  -  YY'  is  nonsingular, 

M(V)  =  X' (U-YY' )_1X 

=  lim  M(V.)  =  lim  X.fU.-Y.Y!)  X., 

A(M(V) )  =  lim  A(M(V.)) 

by  continuity.  Since  A  is  closed,  A(M(V))  e  A.  Then  V  e  A.  Q.E.D. 

Proposition  3.  If  A  C  R™  is  convex  and  monotone  in  majorization, 
then  A*  is  convex. 


Proof.  Suppose  x,y  e  A*.  For  a  vector  z  let  z  denote 
2+  =  (®[1] . '[m]5'  €  R<-  ‘N0W 

(px+qy)^  px+  ♦  qy+ 

because 

max(px.+qy.)  $  p  max  x.  +  q  max  y. 
i  i  i 

max  {(px.+qy.)  +  (px  +qy  ) } 

(i.j)  j  j 

S  p  max  (x.+x.)  +  q  max  (y.+y  )  , 

(i.j)  J  (i.j)  J 


etc.  Hence, 


(px+qy)  e  A  and  px  +  qyeA*.  Q.E.D. 


2  6 


Proposition  4.  C(X)  is  convex. 

Proof.  Let  x,y  X.  Then  x^,  y+  X.  As  in  the  proof  of 
Proposition  3  we  have 

(P?+<^  +  p?l  +  • 

Now  px+  +  qy^  pA  +  qA  =  X.  Hence 

(px+qy)+  X  , 

and 

p; +  qy  x  . 

It  is  obvious  that  if  x,y  e  R™,  then  px  +  qy  e  R™.  Q.E.D. 

Proposition  5.  If  A  is  monotone,  then  A*  is  monotone. 


Proof.  Let  y  e  A*  and  for  x  assume  x^  £  y^,  i  =  1 . m.  The 

point  of  this  proposition  is  that  the  permutations  transforming  x  -*•  x,, 
y  y ,  might  be  different.  Note  the  relation 

xm  =  max  {min(x.  ,...,x,  )}. 

(ij» • • • i 1  ^ 


Then  for  any  (ij.---.ijj) 


minCx.  ,...,x,  )  $  min(y.  ,...,y.  ) 
1  Xk  X1  \ 


£  max  {mxn(y.  ,...,y.  )}  =  y 
(jj . Jk)  k 


w 
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"ence  x[k]  *  y[k]> 


k  =  This  with  e  A  implies  x+  e  A. 


Hence  x  e  A*. 
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